Computational identification of functional introns: high positional conservation of introns that harbor RNA genes
نویسندگان
چکیده
An appreciable fraction of introns is thought to have some function, but there is no obvious way to predict which specific intron is likely to be functional. We hypothesize that functional introns experience a different selection regime than non-functional ones and will therefore show distinct evolutionary histories. In particular, we expect functional introns to be more resistant to loss, and that this would be reflected in high conservation of their position with respect to the coding sequence. To test this hypothesis, we focused on introns whose function comes about from microRNAs and snoRNAs that are embedded within their sequence. We built a data set of orthologous genes across 28 eukaryotic species, reconstructed the evolutionary histories of their introns and compared functional introns with the rest of the introns. We found that, indeed, the position of microRNA- and snoRNA-bearing introns is significantly more conserved. In addition, we found that both families of RNA genes settled within introns early during metazoan evolution. We identified several easily computable intronic properties that can be used to detect functional introns in general, thereby suggesting a new strategy to pinpoint non-coding cellular functions.
منابع مشابه
The Function of Introns
The intron-exon architecture of many eukaryotic genes raises the intriguing question of whether this unique organization serves any function, or is it simply a result of the spread of functionless introns in eukaryotic genomes. In this review, we show that introns in contemporary species fulfill a broad spectrum of functions, and are involved in virtually every step of mRNA processing. We propo...
متن کاملComparative bioinformatics analysis of a wild diploid Gossypium with two cultivated allotetraploid species
Background: Gossypium thurberi is a wild diploid species that has been used to improve cultivated allotetraploid cotton. G. thurberi belongs to D genome, which is an important wild bio-source for the cotton breeding and genetic research. To a certain degree, chloroplast DNA sequence information are a versatile tool for species identification and phylogenetic implications in plants. Different ch...
متن کاملIntrons regulate RNA and protein abundance in yeast.
The purpose of introns in the architecturally simple genome of Saccharomyces cerevisiae is not well understood. To assay the functional relevance of introns, a series of computational analyses and several detailed deletion studies were completed on the intronic genes of S. cerevisiae. Mining existing data from genomewide studies on yeast revealed that intron-containing genes produce more RNA an...
متن کاملNovel Intronic RNA Structures Contribute to Maintenance of Phenotype in Saccharomyces cerevisiae
The Saccharomyces cerevisiae genome has undergone extensive intron loss during its evolutionary history. It has been suggested that the few remaining introns (in only 5% of protein-coding genes) are retained because of their impact on function under stress conditions. Here, we explore the possibility that novel noncoding RNA structures (ncRNAs) are embedded within intronic sequences and are con...
متن کاملSequential splicing of a group II twintron in the marine cyanobacterium Trichodesmium
The marine cyanobacterium Trichodesmium is unusual in its genomic architecture as 40% of the genome is occupied by non-coding DNA. Although the majority of it is transcribed into RNA, it is not well understood why such a large non-coding genome fraction is maintained. Mobile genetic elements can contribute to genome expansion. Many bacteria harbor introns whereas twintrons, introns-in-introns, ...
متن کامل